# Spectrogram Transformer
Ssast Small Patch Audioset 16 16
Bsd-3-clause
Audio classification model pre-trained on AudioSet and Librispeech, using vision transformer architecture to process audio spectrograms
Audio Classification
Transformers

S
Simon-Kotchou
2,408
1
Ast Finetuned Audioset 14 14 0.443
Bsd-3-clause
An audio spectrogram transformer fine-tuned on the AudioSet dataset, which converts audio into spectrograms and processes them using a vision transformer architecture, achieving excellent performance in audio classification tasks.
Audio Classification
Transformers

A
MIT
194.20k
5
Featured Recommended AI Models